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Abstract 

Polynomial time reductions between problems have long been used to delineate 
problem classes. Simulation reductions also exist, where an oracle for simulation from 
some probability distribution can be employed together with an oracle for Bernoulli 
draws in order to obtain a draw from a different distribution. Here linear time simula- 
tion reductions are given for: the Ising spins world to the Ising subgraphs world and the 
Ising subgraphs world to the Ising spins world. This answers a long standing question 
of whether such a direct relationship between these two versions of the Ising model 
existed. Moreover, these reductions result in the first method for perfect simulation 
from the subgraphs world and a new Swendsen-Wang style Markov chain for the Ising 
model. The method used is to write the desired distribution with set parameters as a 
mixture of distributions where the parameters are at their extreme values. 

1 Introduction 

In this paper, two different forms of the Ising model are linked via a polynomial time 
simulation reduction. Consider a family of probability distributions vr parametrized 
by inputs I, together with another family vr'. Say that tt is simulation reducible to vr' 
if an algorithm exists for drawing from vr on T that is allowed to take draws from vr' 
as well as utilize extra randomness in the form of independent Bernoulli draws whose 
parameter is determined at runtime. In other words, if the ability to draw from vr' 
together with the ability to flip coins with arbitrary probabilities of coming up heads is 
enough to draw a sample from vr, then vr is simulation reducible to vr'. As with problem 
reductions, sampling reductions are of most interest when they are polynomial, that 
is, when the sum of the sizes of the inputs to vr' plus the number of Bernoullis used is 
a polynomial in the size of I. 

A famous example of simulation reductions appears in the Swendsen-Wang ap- 
proach [9] to the Ising model. The original version of the Ising model used spins on 
nodes in graphs (call this the spins view) while another used subsets of edges in graphs 
(call this the random cluster view). As part of the Swendsen-Wang algorithm, a draw 
from the random cluster model could be turned (using a number of Bernoullis equal 
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to the number of clusters formed by the edges) into a spins draw. Similarly, a spins 
draw could be turned (using a number of BernouUis equal to the number of edges in 
the graph) into a random cluster draw. 

In this work, a third view of the Ising model is dealt with. This third view also 
uses subsets of edges, but assigns them weights that are very different from those of 
the random cluster model. Following [3], this third model will be called the subgraphs 
view of the Ising model. 

The families of distributions corresponding to spins, random clusters, and subgraphs 
will be denoted TTspinsi TTrc, and TTgubs respectively. The remainder of the paper is 
organized as follows. In Section 2, the three families of the Ising model are described 
in detail. Section 3 discusses the framework that will be used for the reductions. 
Section 4 shows how a single draw from the subgraphs model together with a number 
of Bernoullis at most equal to the number of edges can be used to create a draw from 
the random cluster model. Section 5 then shows the reverse direction: how a single 
draw from the random cluster model together with a number of Bernoullis equal to the 
number of edges can be used to draw from the subgraphs model. Since the reductions 
between TTrc and TTgpins are well known, and require a single sample each, it is possible 
to take a single draw from TTgubs and convert it to a single draw from TTspins with a small 
number of Bernoullis. Similarly, a single draw from TTgpins can be converted to a single 
draw from TTgubs with a small number of Bernoullis. This yields a new Swendsen-Wang 
style Markov chain for approximately sampling from the Ising model. Also, it answers 
a long standing question in [3] of whether or not such a direct relationship between the 
distributions TTgping and TTgubs existed. These results are discussed in detail in Section 6. 

Earlier work in [7] created an approximate simulation reduction from TTgping to TTgubs 
by creating a series of approximations of partition functions for multiple inputs. This 
method has the drawback of requiring multiple draws from TTgubs in order to obtain a 
single configuration that is approximately drawn from TTgping. In contrast, the reduction 
presented here is exact, and requires only one draw from TTgubs to generate one draw 
from TTgpins or vice versa. Because the reduction is exact, this provides for the first 
perfect simulation algorithm for TTgubs; the details are also in Section 6. 

2 The Ising model 

The Ising model was initially proposed as a model for magnitization and has been 
studied extensively becuase of the presence of a phase transition in lattices of dimen- 
sion two or higher [8]. It has also been employed in statistical applications including 
agricultural studies and image restoration [1]. 

Spins model In the ferromagnetic Ising model each node of a graph G = (V, E) is 
assigned either a 1 or -1. Because of the original use as a model of magnetism, the 1 

nodes are referred to as "spin up" and the -1 nodes are called "spin down". 

A state X G {—1,1}^ is called a configuration. The weight of a configuration is of 
the form: 

W^spins(a;) = Yi /(^(^)'^(i)) 
{i,j}eE 
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For a weight function i/^spinsCs^); the probabihty distribution is TTspinsC^^) = '*^spins(3^)/-^s 



-'spins 5 



where ^spins = Y2xe{-1 1}^ ^spins(2;) is called the partition function. 

Let P he a nonnegative vector over the edges E, and be a nonnegative vector 
over the nodes V. Then 



f{x{{),x{j)) 



exp{P({i,j})x{i)x{j)) 
l{x{i) = x{j)) 



P{{i,j}) < oo 



(1) 



Hence (3 controls the strength of interaction between spins at endpoints of edges. When 
P is infinite, the attraction is infinite, and endpoints must be given the same value. 

An extension of the basic Ising model is to allow for an external magnetic field. In 
this case, the weight function acquires factors based directly on the value of each node: 



'i^spins+field (^) 



where g is paramerized by a vector B that controls the strength of the magnetic field: 



cxp(B{i)x{i)) 
l{x{i) = 1) 
= -1) 



for 
for 
for 



\B{i)\ < oo 
B{i) = oo 
= -oo 



Say that the magnetic field is unidirectional if B > or B < 0. The unidirectional 
model is easier than the model with general magnetic field in the sense that it can be 
easily reduced to the case with no magnetic field as follows. Suppose without loss of 
generality B >0. 

If B{v) < oo for all v, then build a new graph by creating a dummy node vb and 
for all i E V adding an edge {i, us} with edge weight P{{i,VB}) = {l/2)B{i). Note 
that any configuration x satisfying x{vb) = 1 has 



w, 



spins + field (a; ("V^)) = lt^spinsonly(x(y)) JJ exp((l/2)B(z)). 

ie^:-B(i)<C!0 



Since the weights differ by only a constant, the probability distributions are the same. 

Moreover, under Wspins only the weight of x and —x are the same, so taking a draw 
from Wgpins and choosing x or —x so that vb has value 1 is an easy way to draw 
from u;spins conditioned on being 1. Hence the unidirectional magnetic field can be 
brought into the case of no magnetic field with no loss of generality when B{v) < oo 
for all V. 

When there exist one or more nodes with B{v) = oo, there is no need to create the 
dummy node vb- These nodes all must be spin up, and so they can be merged, and 
then the merged node can be used in the same fashion as above. 



Subgraphs model In a seminal paper Jerrum and Sinclair [3] showed how to approx- 
imate .^spins(/5) for any graph and any /3 in polynomial time. Their approach did not 
tackle the spins model directly. Instead, they developed results for a different formu- 
lation of the Ising model: the subgraphs model. This model is also a Markov random 
field, but assigns values to the edges of the graph rather than the nodes. 
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Let figubs = {0) 1}^) and now a configuration y G rigubs can be viewed as a collection 
of edges of the graph, where y(e) = 1 indicates that the edge is in the collection and 
otherwise it is not. Since each y encodes a subset of edges, this was called the subgraphs 
world in [3]. Define A over the edges as follows: 

A({i,j}) =tanh/3({i,i}) (2) 

where tanhoo is taken to be 1. Then in the subgraphs model, the weight function is 



w^subs(y) 



n ^(e) 

e:y(e)=l 



]^l(deg(i,y) is odd) 



The degree of a node i in subgraph configuration y is 

deg{i,y) ■=^y{{i,j}), 



(3) 



(4) 



and 1 (A) is the indicator function that has value 1 if the Boolean expression A is true, 
and is otherwise. 

As with the spins model, 7rsubs(y) = ^subs(y)/-^subs) where .^subs is the sum of the 
weights over all configurations, or equivalently the normalizing constant that makes 
TTsubs a probability distribution. The subgraphs model is also known as the high tem- 
perature expansion, and was introduced by van der Waerden [10]. Like the spins model, 
here the weight depends on the product of individual factors that depend either on a 
single edge or only on the edges leaving a particular node. 

One note: in the Jerrum and Sinclair [3] paper the formulation of the subgraphs 
world included terms that allowed for a unidirectional magnetic field. As noted in the 
previous section, a unidirectional field can be eliminated from the problem without loss 
of generality, and so here the simpler form of the subgraphs model is used. 



Random Cluster model A third approach to the Ising model was introduced by 
Fortuin and Kasteleyn [2]. Unlike the spins and subgraphs views of the Ising model, 
the random cluster model is decidedly nonlocal in its weight function. 

Like the subgraphs model, the state space Q^c = {0, 1}^ indexes a collection 
of edges. These edges partition the nodes V into the set of maximally connected 
components, known as clusters. To be precise, consider a configuration z G $7^- 
A collection of nodes C is a cluster in z if for all v and v' in C, there is a path 
{v = vi,V2,V3, . . . ,Vn = v'} such that {wjjWj+i} G E and z{{vi,Vi+i}) = 1 for all i. 
Moreover, for all w G C and v" ^ C, either {v, v"} ^ E or z{{v, v"}) = 0. Let C denote 
the set of clusters. 

The weight function for the random cluster model is: 



Wrc{z) 



n p(^) 

e:z{e)=l 



n i^-pi^)) 

e:z{e)=0 



2#C. 



where 



p(e) = l-exp(-2/3(e)). 



(5) 
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and exp(— oo) is taken to be 0. 

The procedure for generating a spins draw from a random cluster draw is as follows: 
independently for each cluster, draw uniformly from {—1, 1} and assign all nodes in 
that cluster the randomly chosen value. (See [9] for further details.) 

The idea behind this is as follows. The spins model with parameter (3 can be viewed 
as a mixture of many different spins models, each with a different set of parameters. 
Each component of the mixture has a parameter vector whose {3 values for every edge 
are either or oo, and so altogether there are 2*^ different components of the mixture. 
The random cluster configuration indexes which component of the mixture to use: 
z{e) = 1 indicates (3{e) = oo, while z{e) = indicates /3(e) = 0. Drawing a spins 
model with such a simple parameter vector is easy: all nodes connected by edges with 
/3(e) = oo must have the same spin, while the /3(e) = edges might as well be gone 
from the graph. 

This idea of using an index from a mixture is developed formally in Section 3. 

Relationship between spins and subgraphs The normalizng constants .^spins and 

-^subs are related by an easy to calculate constant. The following result goes back 
to [5]. 

Theorem 1. 

■^spins ~ 

e 

While this result is straightforward to show analytically, such a proof does not offer 
much probabilistic insight into why this remarkable relationship is true. In [3] it is noted 
that "there is no direct correspondence between configurations in the two domains and 
the subgraph configurations have no obvious physical significance" . In Section 6 a new 
proof of this result is presented that is based on the simulation reductions presented 
here. 

In [3], a Markov chain that moved among configurations in the subgraphs world 
was developed. More importantly, this chain was shown to be rapidly mixing for all 
graphs. This result could then be used with the idea of selfreducibility [4] to obtain a 
fully polynomial approximation scheme (fpras) for .^subsi and hence .^spins as well. 

3 Drawing from mixtures 

The reductions between spins and subgraphs draws come from viewing a distribution 
TT as a convex mixture of several distributions, so 

TT = aiTTl H omttm, 

where ai + ■ ■ ■ au = 1- 

Typically the parameters for distributions of interest (like the Ising model) form 
a convex set. The mixture of tt uses distributions where the parameter values are at 
their extreme values. Usually these extreme values of parameters correspond to simpler 
distributions. Many times, in reducing from tt to vr', vr' = aiTr^ + • • • umt^'m where the 
ai are equal to the cofficients for tt. Therefore, the algorithm proceeds as follows: 



5 



(1) Draw a sample from tt, 

(2) Choose which tt^ the sample came from, 

(3) Draw a sample from it'-, 

(4) Return as a sample from tt'. 

This can be formulated as a special case of the auxilliary variable method. Let / be 
a random variable where P(/ = i) = ctj (this extra variable / is the auxilliary variable.) 
If X\I ~ TT/, then X IT. The algorithm takes advantage of this in reverse: Given 
X ^ TT begin by choosing I\X. Then choose X'\I from tt'. Then X' ~ tt' as desired. 

Typically, direct computation of the ccj is difficult, which is why it is not possible 
to just draw a random I directly. However, drawing I\X is possible, by proceeding 
through several stages. 

Begin by considering a mixture of two distributions, so tt = aivri + a27r2. At this 
first stage, choose /(I) G {1,2} given X. Then break 7r7(i) into two distributions, and 
so on until the index / = (J(l), J(2), . . . ,I{k)) has been chosen. Typically the stages 
are set up so that tt'j is easy to sample from. The following theorem provides the 
algorithm for choosing I{a) given X from the two choices. 

Theorem 2. Consider three distributions n, tti and 7r2, respectively defined by unnor- 
rn,alized weight functions w, wi and W2 with the appropriate normalizing constants Z = 
J2y ^"iy)^ ^1 = IZyWiiy), andZ2 = Yjy'^"-2{y)- Suppose thatw{x) = ciWi{x)+C2W2ix), 
where the Ci are positive constants. 

Then tt is a mixture o/vri and tt2, so tt = aivri + Q;2'7r2. Let X ^ tt. Let I\X be a 
binary random variable that is 1 with probability ciWi{X)/[ciWi{X) + C2W2{X)\ and is 
2 otherwise. Then P(/ = 1) = ai, P(/ = 2) = a2, and [X\I] ~ tt/. 

Proof. Start with w{x) = ciwi{x) + C2W2{x) and divide through by Z. This yields 
7r(x) on the left hand side. Multiply and divide term i on the right hand side by Zi to 
obtain: 

7r(x) = ClTTl{x){Zi/Z) + C2Tr2{x){Z2/Z). 

Hence ai = ciZi/Z and a2 = C2Z2/Z. Also, 



a,; 



y y ^ ' y 

Turning it around: 

¥{X = y\I = i)= P(7 = i\X = y)Tr{y) ai = — = — — . 

w{y) Z CiZi Zi 

So = i] has distribution tTj, completing the proof. □ 

This theorem can be easily extended to an arbitrary number of distributions, how- 
ever, mixtures of tti and tt2 suffice for all the results in this paper. 
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4 Subgraphs to random cluster 



In this section it is shown how to utiUze a subgraphs draw together with at most jj^E 
BernoulU draws to generate a draw from the random cluster model. This can then be 
used to create a spins draw if desired. 

Let VTsubs be the distribution parameterized by A. Consider any edge e with A(e) G 
(0, 1). Create a new vector Ae->i that takes on value 1 on edge e, and matches A at all 
other edges. Similarly, let Ae->o equal A on all edges but e, and let Ae-»o(e) = 0. 

Then since A(e) G (0,1): 

Wsvihsiv; A) = A(e)u;subs(2/; + (1 - A(e))'u;subs(y; Ae-*o), (7) 

This has the exact form needed for Theorem 2. The algorithm is simple. Begin 
with a draw Y from vr parameterized with A. Calculate A(e) times the weight of Y 
under Ae~^i, and (1 — A(e)) times the weight of Y under Ag^o- Draw index / from 
{1,0} with probability proportional to these two numbers, then set A(e) to /. 

This means P(/ = 1) = A(e)u;subs(^; Ae^i)/ifsubs(^; A). When Y{e) = 1, this is 
just 1. When Y{e) = 0, this becomes A(e). Therefore 

P(7 = 1) = A(e) + (l-A(e))F(e). 

After choosing the value of /, Y given I comes from the correct half of the mixture, 
and so the process can be repeated again, until all of the entries in A are either or 1. 

Algorithm 1 Reduce edge weights 
Input: parameter A, y ~ 7rsubs(s A)). 
Output: new edge parameters A 

1: for all edges e with A(e) G (0, 1) do 

2: draw A(e) ^ Bern[A(e) + (1 - A(e))F(e)] 

3: end for 



Note that when Y{e) = 1, then \{e)wsuhsiY; Xe^i) / WguhsO^] ^) = 1; so the algo- 
rithm always sets A(e) to 1 in this case. Only when Y[e) = is there a choice with the 
possibility of A(e) = 0. 

At the end of the this algorithm, the draw Y has been slotted into one of 
components of a mixture indexed by the new parameter vector A. Each clement of A is 
now either or 1, in the spins world this corresponds to f3 being either or oo. Hence 
the two nodes are either independent of each other or forced to be the same, just as in 
the random cluster model. 

This means, if the random output of Algorithm 1 is called A, then A is a draw from 
the random cluster model of Fortuin and Kasteleyn [2] . This is stated precisely in the 
following theorem. 

Theorem 3. Let Y ~ vrsubs(s A). Then let A be the output of Algorithm 1 with input 
(A,y). Zei^)(e) = l-exp(-2/?(e)) =2A(e)/(l + A(e)). Then 



P(A = z) 



1 



n 



n i-^'(^) 



2#C(^) 



e:z{e)=l 



e:z{e)=0 
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where 



Zrc = ^subs2#^-#^ 11(1 + exp(-2/3(e)). 



Proof. Fix z G {0,1}^. Consider a particular cluster in z. Suppose that Y = y, 
and consider the chance that A = z given Y = y. Edges with y(e) = 1 always have 
A(e) = 1, so if z{e) = then y{e) must be zero to have positive chance of A = z. That 
is, y < z. 

Now consider how a random draw could result in A = z given Y = y. For each 
edge with y{e) = and z{e) = 1, there is a A(e) chance of making this choice. Hence 
the chance these edges agree is ne-2/(e)=o z{e)=i H^)- -^^^ ^^^^ edge e with y{e) = 1 and 
z{e) = 1, the chance these edges agree is 1. Finally, if y{e) = and z{e) = 0, the 
chances that these agree is Y\e-y(e)=o z{e)=o ^ ~ -^(^)- Therefore, 



P(A = z\Y = y) 



n ^(«) 



e:y{e)=0,z{e)=l 



n 



1 - \{e) 



e:y{e)=0,z{e)=0 



Now consider the probability that Y = y. Since y is a subgraphs draw, this 
probability is unless all the nodes under y have even degree, in which case the 



probability of choosing y is Z^^^^^W ,.^\{e). This equals Z^^^^^W 



Vy{e)=l,z{e)=l 



smce 



y(e) = 1 implies z{e) = 1 as well. Therefore for all y where each node has even degree 



P(F = y,A = z) 



"'subs 



2-1 

subs 



n ^(^-) 

e:y(e)=l,z{e)=l 



n ^i") 

e:y(e)=0,z{e)=l 



n 1 - ^(^) 

e:y{e)=0,z{e)=0 



n ^(^) 

e:z{e)=l 



n l-A(e) 

e:z(e)=0 



To find P(A = z) all that remains is to sum over all y where each node has even degree. 
Since F{Y = y,A = z) is independent of y, 



P(A = z) = #{y:y<z, deg(i, y) even VijZj^^ 



n 

e:z{e)= 



Me) 



n 

e:z{e)= 



l-A(e) 



(8) 

So now consider how many y < z states have even degree everywhere. Consider a 
particular cluster C in z, and let Ce denote the set of edges between nodes in C. Since 
the nodes are connected, there exists a tree T with edges Te C Ce- Let A = Ce \ Te 
he those edges in the cluster that are not part of the tree. 

The key fact about A is that for any value of y{A) there exists exactly one value of 
y{TE) that ensures that y{TE U ^4) has even degree at each node. See Figure 1 for an 
example. 

To show this, fix y{A). Start with a leaf i of the tree T. There is exactly one edge 
in Te adjacent to i with z{e) = 1, so there is precisely one way to choose y(e) in order 
to maintain that the degree of i must be even under y{e). Now consider the remaining 



8 




Ce — {ci, 62, ^1, ^2, ^3) ^4, ^5} 
Te — {tl, t2, ^3, t^, t^} 

y({ei, 62}) determines yiTs) 



Figure 1: Example of cluster with six nodes 



edges in Te- The nodes connected to these edges forms a smaller tree, so again a leaf 
exists, and again the there is exactly one way to choose x on this edge to maintain the 
even degree requirement. After #T steps the unique value for y{T) that preserves the 
even degree at each node will have been found. 

So given y{A), there is exactly one choice of y over the edges of the cluster where 
all edges of the cluster receive even degree. There are 2*"^ different possible values of 
y{A), and so there are 2*"^ different choices of y over the cluster. The number of edges 
in A is the number of edges in the cluster minus the number of edges in the tree, so 
for a cluster C E C, this is 



#A 



eeCs 



-[#c-i]. 



The choice of y{CE) is independent for each cluster C G C, and so the total number 
of X such that x < z and each node of x has even degree is the product 

CgC 

Combining this with (8) yields 



or equivalently, 



n 2A(e) 

e:2(e)=l 



n i-Me) 

e:2(e)=0 



= -) = ^subs2-*^ 



2*c{z) 



ii 1 + A(e) 

e:2(e)=l 



n \ 



A(e) 



e:2(e)=0 



+ A(e) 



It is easy to verify from (2) and (5) that 2A(e)/(l + A(e)) = p(e) and (1 - A(e))/(1 + 
A(e)) = 1 — p(e), which shows that the two rightmost sets of brackets factors equal 
w,,{z). Therefore Z;^^*'' W^{\ + A(e)) = Z'^K Using (1 + A(e))-i = (1/2)(1 + 
exp(— 2/3(e))) completes the proof. □ 
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5 Random cluster to subgraphs 



This section shows how to take a random cluster draw, and together with a number 
of Bernoullis equal to H^E — ^V, create a draw from the subgraphs model. Swendsen- 
Wang [9] employed the relationship between random clusters and spins to devise a 
Markov chain that is very fast for some values of the parameters. From a spins draw, 
generate a random cluster index, then from the index, generate a new spins draw. 

With the ability to move between a random cluster draw and a subgraphs draw, a 
new Swendsen-Wang style Markov chain becomes available for use: from the random 
cluster model, draw a subgraphs model, then from the subgraphs model, draw a random 
cluster model. Algorithm 2 shows how to move from a random cluster model to a 
subgraphs model. 

Algorithm 2 Random Cluster to subgraphs 
Input: parameter p(e), Z ~ 7rrc(-;p) 

Output: Y ~ TTsubs 

1: Fe <— a maximal forest using edges e with Z{e) — 1. 
2: 

3: for all edges e e E \ Fe do 

4: draw Y{e) ^ Bern(Z(e)/2), D ^ D U {e} 

5: end for 

6: while Fe 7^ do 

7: i any leaf in the forest Fe 

8: j <— the node such that {i,j} G Fe 

9: ^ l(deg(i,F(D)) is odd ) 

10: D^DU{t,j},FE^FE\{t,j} 

11: end while 



Theorem 4. Algorithm 2 returns a draw from the subgraphs distribution where pa- 
rameters X, (3 and p are related by equations (2) and (5). 

Proof. It was shown in Theorem 3 that if K 7i"subs('; X), and then Z\Y' is drawn as an 
index for the component of the mixture, then Z iTj-c{-;p) where p and A are related 
by (2) and (5). So as in the discussion in Section 3, if Z is drawn first from 7rrc(-;p)) 
and then Y\Z ~ 7rsubs(-; Z), then Y ~ TTsubsCs A). 

Let Fe be a maximal spanning forest in the graph using edges with Z{e) = 1. Any 
edge not in Fe with Z{e) = must have Y{e) = in order for the configuration to 
have positive weight. So let e be an edge such that Z(e) = 1 in \ Fe- 

Since Fe is a maximal forest, the endpoints of e must be connected by edges in Fe- 
Call e plus these connecting edges L. Let x be any configuration with x{e) = 0, and 
f{x) be a new configuration constructed as follows. For all e' € L, let /(e') = 1 — x{e'). 
For all e' ^ L, let /(e') = x{e')- This 'flips' the value of x(e') along all the edges of the 
cycle eU L. 

Since the edges are flipped along a cycle, the parity of each node is unchanged. 
This map / is 1-1 and onto, and so the partition function conditioned on y(e) = 
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equals that conditioned on y{e) = 1. In other words, there is a exactly a 1/2 chance 
that Y{e) = 1 for a draw Y ~ VTsubs- 

This holds even when conditioned on the values of all other edges in E \ {Fe U {e}), 
since only edges in Fe plus e were used to construct L. Combining the Z{e) = and 
Z{e) = 1 cases, the result is that Y{e) ^ Bern(2^(e)/2), even when conditioned on the 
value of Y{e') for all other edges e' e not in Fe- Hence lines 3 through 5 of the 
algorithm are correct. 

Now consider edge e = where i is a leaf of Fe- Then Y{e) must be chosen 

in such a way that the degree of i in Y is even. Line 9 accomplishes this task. This 
edge has been assigned a value, and can now be removed from Fe- As long as Fe is 
nonempty, there will be at least two leaves left, so the process can be continued until 
the edges of Fe are all assigned values by Y- □ 

Note that the maximal forest Fe can be constructed using either breadth first or 
depth first search. Either way, reversing the order in which nodes were added to the 
forest provides a way of recovering the leaves one by one with an overall running time 
that is linear in the size of the graph. 



6 Further notes 



6.1 Proof of Theorem 1 

The reduction from subgraphs to random clusters as shown in Theorem 3 together with 
the reduction from random clusters to spins provides an immediate proof of Theorem 1. 

Proof of Theorem 1- Let Z he a random cluster draw, and X the spins draw that is 
obtained by uniformly at random choosing spin up or down for each cluster in Z- 
For a spins configuration x, let S{x) denote the set of random cluster configurations 
that are consistent with a;, so 2; G S{x) means for all £ E, x{i) = x{j) implies 

z{{i,j}) = 1. Then 

F{X = x) = F{X = x\Z = z)F{Z = z) 

zeS(x) 



n P(^) 

e:z(e)=l 



n i^-pi^)) 

e:z{e)=0 



^ (1/2)#CW 
^rc'n E p(eMe) + (l-p(e))(l-z(e)) 

e z{e)<l(x{i)=x(j)) 



2#c(.)^-i 



Now 1 — p{e) = if /3(e) = cxd. When (3{e) < 00, 1 — p{e) = exp(— 2/3(e)). This means 
that for all /3(e): 



l{x{i} = x{j)) + (1 -p{e))lix{i) + xU)) = /(x(z),x(i))exp(-/3(e)) 
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where exp(— oo) is taken to be 0, and / is as in (1). Hence 

¥{X = x) = Z-,'l[[f{x{i),x{j))expi-P{e))], 



which means that Zspms = -^rc Tie ^^P(/^(^))- 
Combining with Theorem 3 yields 



■^spins — -^subs 



n(l + exp(-2/3(e))) 



nexp(/3(e)) 



as desired. 



□ 



6.2 Perfect simulation of subgraphs 

One of the original applications of the coupling from the past algorithm [6] was to 
generate samples perfectly from the random cluster model. Given the reduction of 
the previous section, this immediately gives the first perfect simulation method for the 
subgraphs world, which could prove useful in studying the model. 
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